A Character Recognizer for Turkish Language

نویسندگان

  • Sait Ulas Korkmaz
  • G. Kirçiçegi
  • Y. Akinci
  • Volkan Atalay
چکیده

This paper presents particularly a contextual post processing subsystem for a Turkish machine printed character recognition system. The contextual post processing subsystem is based on positional binary 3gram statistics for Turkish language, an error corrector parser and a lexicon, which contains root words and the inflected forms of the root words. Error corrector parser is used for correcting CR alternatives using Turkish Morphology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language of General Fuzzy Recognizer

In this note first by considering the notion of general fuzzy automata (for simplicity GFA), we define the notions of direct product, restricted direct product and join of two GFA. Also, we introduce some operations on (Fuzzy) sets and then prove some related theorems. Finally we construct the general fuzzy recognizers and recognizable sets and give the notion of (trim) reversal of a given GFA....

متن کامل

Properties of a hand-printed Chinese character recognizer based on contextual vector quantization

A hand-printed Chinese character recognizer based on Contextual Vector Quantization (CVQ) has been built previously. In this paper, several properties of the recognizer will be discussed and the recognizer of 4516 Chinese characters has a successful rate of 91.0%. Then the output of the recognizer is passed to a language model which when applied to recognize a passage of about 1200 characters r...

متن کامل

Turkish handwritten text recognition: a case of agglutinative languages

We describe a system for recognizing unconstrained Turkish handwritten text. Turkish has agglutinative morphology and theoretically an infinite number of words that can be generated by adding more suffixes to the word. This makes lexicon-based recognition approaches, where the most likely word is selected among all the alternatives in a lexicon, unsuitable for Turkish. We describe our approach ...

متن کامل

ON GENERAL FUZZY RECOGNIZERS

In this paper, we de ne the concepts of general fuzzy recognizer, language recognized by a general fuzzy recognizer, the accessible and the coac- cessible parts of a general fuzzy recognizer and the reversal of a general fuzzy recognizer. Then we obtain the relationships between them and construct a topology and some hypergroups on a general fuzzy recognizer.

متن کامل

Hand-Written Chinese Character Recognizer

A n off-line hand-written Chinese character recognizer based on Contextual Vector Quantization (CVQ) supporting a vocabulary of 4,616 Chinese characters, alphanumerics and punctuation symbols has been reported. Trained with a sample for each character from each of 100 writers and tested on texts of 160,000 characters written b y another 200 writers, the average recognition rate is 77.2%. Two st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003